1 AGCTTTATAA CCATGTGATC CCATCTTATG GTTTCAATCC ATGCACAGGA 

51 GGAAAATTGT GGGCACGAAG TTTCCAAAGG GAAAATTTAT AGATTGGTAG 

101 TTAATGAAAT ACAGTTTTCC TCCTTGGCAA ATTTAATTTA CTAGCTTCAC 

151 TGTATAGGAA AAAGCAGGAA AAAAATTAAA ACCAACTCAC CTCCAAACCT 

201 GTTTTGAGCT TTTACTTGTC TGCCCAATTG ATAGTTTCTA CTCTCTGCTT 

251 TTGATGAAAA TATTTTTTAT TATTTTAATG TAACTTCTGA AAACTAAATT 

301 ATCTAGAAGC AAATAAAAAG ATATTGCTTT TATAGTTCCC AGAAGGAAAA 

351 AACAAACACT AGGAAAGTTC TATCTATCAG ATGGGGGAGA TGTGATGGAG 

401 GCAGTGATAT TTGAGCTGAG CCTT GAACAA TGAACAGGAG TCTACCAAGC 

4 51 GAGAGGCTAG CGGGTGGCCC TCAAGATAAA ACAACAGCAT GTACAAAGGC 

501 ATGGAGACAT ACACATCTTG ACTCTTCCAG GAATGGTGGG AACGCTGGTG 

551 GAGCTAGAAT GTAGGTACAT AGCATAAAGT GGCAGACGGG AAGCCTTTGG 

601 AAATCTTATT ACATAGGACC CTGGATGCCA TTCCAATGAC TTTGAATTTT 

651 CTGTAGGCTG CCAGCGAAAT TTCCAAGCGT GATAGAGTCA TGTCTATCTA 

7 01 TGCACTTCAG AAAGACAACC TCAGGGTTAA TGAAGAAAAT GCATTGGAAT 

751 ATAAGAAACT GGT GACCAGA GTGATCAATT GCATGACTGT TGTGAAAGTC 

801 CAGGTGAGGG GAGCTGTGGG CAAGGTCAGA GTTGAGAGGC AT T T CAGAGA 

851 TAAAAT GACA GTAACTAAGT AGATGTCAGG CTGAGAAGAA AGGGCTGTAC 

901 CAGATATATG GTGCTATCAT TAAGTGAGCT CAACATTGCA GAAAAGGGGT 

951 AGGTTTGGTG GGAGTTGCTC AC AAAACAT G TTTAGTCTAA G C AAAAC CAT 

1001 TGCCATGGGC TCAGATAAAA GTTAAGAAGT GGAAACCATT CCTACATTCC 

1051 TATAGGAGCT GCTATCTGGA AGGCCTAGTA TACACGTGGC TTTTCAGCTG 

1101 TGATTTTGTT TGATTTTAGG GATTATTCTT TTTCTGAATC TGAGCAATGT 
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1151 TAGCGTGTAA AATACTCACA 

1201 ATAAATCATA TTGAGTTTGT 

1251 GTCAGGTCAA CTCCATGTGA 

1301 AAAGGTTTCC TGGTACACTG 

1351 AGGTAAGTCT TTGACCCCAC 

14 01 T AAG T AC AAA TATGAGAAGG 

1451 CTCTGTCCAC TGCTTTCGTG 

1501 TTTTCCTTTT GAATTTACCA 

1551 GATTTAGGAG GTTTGCTTCT 

1601 AAAAATTCTC TCTTCAAACA 

1651 CAGGTCCTGT TGGGGGGTGG 

17 01 TCAATAGGGG CAGCAAACCA 

1751 CTGCACGTTC TGCACATGTA 

1801 AAAAAACCTT TTTTCTATTT 

1851 TAGAGAAGGA TAATTGTGCT 

1901 CTGACCTATT TTTAACAAAA 

1951 ATGCRGGGTA AGCAGTTCCT 

20 01 TCTTGAATTC CTCCTTTTAT 

2051 GATGAAAAGC TTATTATCAA 

2101 ATTTGTTACA GTAAAATGGG 

2151 CTAGCATGGC CATTTATTTA 

2201 GCATCTCTCC TGTGGTTCCA 

2251 G AC C GAG AAA CCCTGCCAAA 

2301 CGATGCAAAG GGGCTAGAAT 



CCCACAGCTT TGACTGGGTG AGAAGT TAT C 
TGTGATACCT TCAGCTTCAA CAAGTGATGA 
AAGTTCCTTG CTAAGCATGC AGATATTCTG 
GCTCATGGCA CAGAT AG GAG AAATTGAGGA 
CTGATAACAC CTAGTTTGAG TCAACCTGGT 
CTTCTCATTC AGGTCCATGC TTGCCTACTC 
AAGACAAGAT GAAGTTCACA GTGAGTAGAT 
CCAAATGATT GGAGACTGTC AATATTCTGA 
TATGGCCCCA TCATGGAAAG TTTGTTTTAA 
CATGGACACA GAGAGGGGAA CAACACACAC 
AGAGTGAGGG GAGGGAACTT AGAGGACAGG 
CCATGGCACA CATATACCTA TGTAACAAAC 
TCCCTTTTTT TTAGAAGAAG AAATAATGAA 
ATATAATCAT GGCATTTATA AGCATCTCTA 
GAGATTAGAC AGCTGTCTGA GCACCTCACA 
TGACTTTCCA CAT C AC CT GA TTTCGGCTCC 
AAGCCCTAGA AAGTGCCGAT CATCCCTCAT 
TTACCAAAAT TCCTGAGCAT GTTCAGGAAA 
AATAAGTGGC TGAGATAGAC TTCTTGTCAC 
TCTCCAAGAA AG AAAGAT T T GCCTTGGGCT 
AGAAAGCATC TGAAACATGA AGCTACCACA 
GACGGAAGCC TGAGAGTCTA GGAGGAGGTG 
GTAACTAGTA GTGCCGGGTT TCTCACAACA 
CAGATGACTA TTTTCATGTT T C AACAT ACT 
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2351 AC AC AC T G GA AAACGTTACG GCAGACTCTA CTTTATAATG GGGCTGCAAA 

2401 TGTAAAATGA CTACTAGAAC TAGGTCCTCT TAATAGCAGC AAAGTTTAAA 

2451 AGGGTCAGAG GGAGCTCCAG ACACAGGTTA GATTTGATTT CTCTCCTAGT 

2501 TCTGCTGTGA ACAAGAGGTA TAAGTTTGGC CAACTCACTT AACCCCT GAA 

2551 GCTCAGTTAC CTTATCTGTA AAATGATTGC ATTGTACTAG GTGTTCTCTA 

2 601 AAATTTCTTC TACCTCTGAC TTTTTAGGAG ACTAATTTTT AACTCCTTTT 

2651 TAAGCTATTG GGAGAAAAAT TTAATTTTTT TTCAAAAGTT ACCTTGAATC 

27 01 TCTAGAGCAG TTCTCAAAAC TATTTTGTCC CAGGCAAAGG AAATGAGACT 

27 51 AGGTACCCAG AATGAGGCAC CCTGCATAAA GCTCTGTGCT CTGAAAACCA 

28 01 ATGTCAGGGA CCCTGTGATA AATAATTAAA CCAAGTATCC TGGGACACTG 
2851 CTAGTGACAT CGCCTCTGCT GAT CACTCTT GCCAGCGAGA CACTCTATAC 
2901 TTGCTTTCTC ATCATTGGCA TCCAAACTGC CTACTAATCC ATTGCTTTGG 
2 951 AAAGTTTTTT TTAATAAAAA GATTATTTCT ATTAGGAGGA AAACATCCCA 
3001 TGTTAAATAG GAAAATTAAC TGAAATCATT TTCAGATGTG ATTTTTAGCA 
3051 CTTATAGCCA TTTCAAACCA TGGTATTCAT TTATACTATG CTATTTATTG 
3101 TAAAACTTCT TTTTTTTTCC AAGGAAAATA AGATAGTTTG CTTTATTTTA 
3151 AAACAGTAAC TTTCTTATAT TGGGGCACTG ACCAAAATTC AATACTGGTA 
3201 CAAATATGTT ACCTAGGGGG TCAAAATATG TGCCAGGTGA ATTTTCTGAA 
32 51 TTTCTCTAAA GAGAGAATTT T AAACCT TAT AAAACAATTA GAAAC AAGT G 
3301 AGTGAGAGGT GAGCATCAAC AACCTGTGTA ACATAAGCCA CAGTACAAAT 
3351 TTAAGCTGAA TAACCAAGCC ATGTCAGTTA TCCCAAATCA TTTTTGTTAA 
34 01 TATTTAGGAG GATACACATA TTTTCAATAA CTTAAAAGTG AATCTTTACT 
3451 CCTATCTCTT AATACTCGAA GAAGTATAAC TTTCTTCTTT TACTAGATTT 
3501 AAATAATCCA AATATCTACT CAAGGTAGGA TGCTGT CATT AACTATAGCT 
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3551 GAGTTTATCC AAAATAGAAA AAT CAT GAAG ATTTATAAAG CATTTTAAAA 

3 601 AT AAT C ATT T ATAGCAAGTC CTTGAAAGCT CTAAATAAGA AAGGCAGTTC 

3 651 TCTACTTTCT AATAACACCT ATGGTTTATA T T AC AT AAT A TAATTCAACA 

37 01 AAACAGCATT CTGACCAATG ATAATTTATA GGAAATTCAT TTGCCAAGTA 

37 51 TATGTTTTAT TATAAAGTTA ATATTTTGAC C AAT C T T AAA AATTTTTAAA 

38 01 CTCTATTCTG AC AT T T C C AG AAGTATTATC TTAGCAAGTC ATCTTTATGA 
38 51 TACCACTTAT TAAACTGAAG AGAAACAAGA TGGTACATTC TGGGTTTTAC 
3 901 TTTAAAAGGG ATTTGATTCA AT AAT T T GAT TTATCACTAC TTGAAAATTA 

3 951 CATTTTCTTC CTCAGACTGG ATGGCAATGA GAT GAAAGCA GCTTTCCTGG 
4001 CTCTCAACTT CCCTTCTTCA TCAATTTTTC CAGCGTTTCA TAAGGCCTAC 

4 051 ACTAAAAATT CTAAAACTAT ATATCACATT AATATAATTA CTTATAATTA 
4101 ATCAGCAATT TCACATTATC GTTAAAACCT TTATGGTTAA AAAATGCAAG 
4151 GTAAGAGAAG AAAAAAACAC ATTGAACTAG AACTGAACAC ATTGGTAAAA 
42 01 TTAGTGAATA C TT TT CAT AA GCTTGGATAG AGGAAGAAAG AAGACAT CAT 
4 251 TTTGCCATGT AAC AG GAG AC CAATGTTATT TGTGATTTCA GATTGTCTTT 
4301 GCTGGACTTC TTGGAGTCTT TCTAGCTCCT GCCCTAGCTA ACTATGTAAG 
4351 TCTCACCTTT TCAAGTTTGC TACCAAAATG CATTTGCAAG GAAATGTGAT 
4 4 01 ATTAAATCAC TCTCAATCTC TTATAAACTT CAGAATATCA ACGTCAATGA 
4451 TGACAACAAC AATGCTGGAA GTGGGCAGCA GTCAGTGAGT GTCAACAATG 
4501 AAC AC AAT GT GGCCAATGTT GACAATAACA ACGGATGGGA CTCCTGGAAT 
4551 TCCATCTGGG ATTATGGAAA TGTAGGTAGT CAACGTGCAA TTTTCACTTT 
4 601 ATTGTTTAAA AATACGACTT CTTTTTAACA AAAAATGTGC ATGTTAACCA 
4 651 TAAAGAAATT AAAAATAAAT TCTAATTACA CATAGCATAC AGTTATAAGT 
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4701 AAAGGTGACC ATTTTGCTCA TCCGATTTTG TTCCCTAGAG ATAACTACTG 

4751 TTAATAAGTG TTGCATGATC AGTTAAAATT CAAACCAACA AACACTATGT 

4 801 TCAAGGGATT GTGGGTATAT ACAACAAATA TGAACATCCT TTTGCCTTGC 

4851 CTGCAGATAC CCTCAATAAT GCTGAAAGAC TTATACAACA TTACTGCTTC 

4 901 CAAAGCTTAG ACTATCTCAC TTTGTTTTCA AAGGAGGTTT TACGACCTTC 

4 951 TAAAGAGATT GAAATTGACA TTTCACCTAA AACTCGGGAA ATGTAAATGA 

5001 CAATATTAAT TGGTAAGAGA GGAAAGAAGA AAGAAAGAAG GAAGGAAAGA 

5051 AAGAAAGAAG GAAGGAAGGA AAGAAAGAAA GAAAGAAAGA AAGAGAGAGA 

5101 AAGAAAGAAA AAGAAAAAAG AGAGAAAGAG AGAAGGAAAG AAAGAGAGAA 

5151 GGAAAGGAAA AGAGAAGCAA AGAAAGAGAG GAGCAAAGAA AGGAACACTT 

5201 AGCACTAGTT GGGAGACCCA ACTCTGGAAT TATCAGCTAT ATATTTAACA 

5251 AACGTTATAC TTTTAAATAG CAAACTCTTT ATTGTTTCAA TTTTATCTGG 

5301 TCAATTGGAA AAATAATTTT TGTCTTATCT GTCTCCTTGA AATGTGAGGA 

5351 TCAAAGGAGA CTAAAACATG ATAGCTTTTA AAGTCTATTT CAGTAAAACA 

54 01 GACTTATATA GAGGGGTTTT TATCATGCTG GAACCTGGAA ATAAAGCAAA 

54 51 CCAGTTAGAT GCTCAGTCTC TGCCCTCACA GAATTGCAGT CTGTCCCCAC 

5501 AAATGTCAGC AAT AG AT AT G ATTGCCAAGC AGTGCCCCAT CCAGTGCTCT 

5551 TATCCCAGCT CATCACGATC TTGGAGTTCC CATTTCTCTC TGCAGGTGGA 

5601 ACTGACCTCT GATAAGAAAA GCTCCTCGGA GAACACATGC CTCACTATTT 

5651 GCCATCTACT TTAACAGGGC TTTGCTGCAA CCAGACTCTT TCAAAAGAAG 

57 01 ACATGCATTG TGCACAAAAT GAACAAGGAA GTCATGCCCT CCATTCAATC 

57 51 CCTTGATGCA CTGGTCAAGG AAAAGAAGGT AAAAATAAAA GGCTTTTTAT 

58 01 TTTTGGTGAG GGGAGAGGTT TTACATCCTT CAGTAAATAA CGAGAAGATC 
5851 ACAGTCATTC CCTCTTGACT ACAGTATGTT GTAGTGTGCA GCACAAAGGG 
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5901 GGAAGTTATT GGTGATTGCC TGAGGGAAGG CAACTTCTGC CACATCAAAT 

5951 GCTGTGGCTC ACACCTACCT CTACAACCGC TGAGCAAAGC ACTTGAAACC 

6001 TTGACTGTTA GAGGAGCAAA GCTCTGGTCA CACCAATAGG AGCCTCAGTA 

6051 CTTTGCCAAG GACATTTTTC TGCAAGAGTT AGTTAGGGTT ATTAGATTTA 

6101 GCAAATGAAA ATAGAAGATA TCCAGTTAGG TTTGAATTTT AGGTAAGCAG 

6151 CAGGTCTTTT TAGTATAATA TATCCTATGC AATATTTGGG ATATACTAAA 

6201 AAAAGAT CCA TTGTTATCTG AAATTCAAAT GTAACT GGGT ATTGTATATT 

6251 TTGTCTGGCC ATACTAATCC AGGTGAGTGG AAAGAAGAGA TCCATAATGT 

6301 TTTAAAATAT TTGCCTGAGT TCATATTCCT ATAACTGATA AATGAGTACC 

6351 TTTCATTGAC AAGGTAGAGA AAATAAATAA ACTGCATTCT C AG AAG AT G A 

64 01 TTATTACATA GTCTAATCCA AGGAATCTAT GATGAC C AAA TGAGGTCCAA 

64 51 GTTGCAGAAT AAATTAAGCC TCAGACTTCT GTGTTTATGA GAAGCTGAGG 

6501 TTTCAAACCA GGTAAATCCC TTAGGACACT TAGAAATGCT AAGATATACA 

6551 GAATAAGCTA GAAATGGCTC TTCTTCATCT TGATTATGGA AAAATTTAGC 

6601 TGAGCAACAC TCACTGTTGG CCTCGTATAC CCCTCAAGTC AACAAACCAC 

6651 TGGGCTTGGC ATTCATTCTC TCCCATTCTT CCTTTCTACC TCTCTTTTCC 

67 01 ACACTCAGCT TCAGGGTAAG GGACCAGGAG GACCACCTCC CAAGGGCCTG 

6751 ATGTACT CAG TCAACCCAAA CAAAGTCGAT GACCTGAGCA AGTTCGGAAA 

6801 AAACATTGCA AACATGTGTC GTGGGATTCC AACATACATG GCTGAGGAGA 

6851 TGCAAGGTGA GTAGCATCCC TACTGTGCAC CCCAAGTTAG TGCTGGTGGG 

6901 ATTGTCAGAC TATCCTCGCG CGTGTCCATA GTGGGCACCA GTGATGCAGG 

6951 GATGGTCATC AAGGCCAACA TTTGTGCAGT GCTTGCTCTG TGCCAGGTAC 

7 001 TGTTCTATGT GCTTTAAGTG TGTTAACTCG GTTCTTCACA GCAATCTTAT 

7 051 AGGTTCTATT TTAATCCTAC TTTATGGATG AGGAAACTGA GGTACAGAGA 
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7101 GGTCACAAAA TCCTTGCCTG GGTCAATTCC AAGCATTTTG GCTGTGGATT 

7151 CTGTGCTCTT AAATATTATG GAACACTGCC TTTTAAGTGT GAATCAAGAG 

72 01 TAGACT CAAG TCATATTCAA AAGAATGCAT GAATGGCTAA ATGAAAGAAG 

7251 AATGCTAATA GAATCTATTA ACTTTCTATA GCTCAGACAA TCACTTAATT 

7301 TCTGGACATT CAAAGAACAG CTGCACACAA ACAAAGTGTC TACCTAGGGA 

7351 CCTAACTTAA TGGCAATTTT CCAGATCTCT GAATTGATTG ATTTCATCAC 

74 01 AACAAGTAGA TAAACCTTGA CATTAGCACA TAGCTAGTTT GGAAACCCCT 

74 51 ACTCCCCCAA TCCCCTCCAA GAAAAGAGTC CTTAAATAGA CATTAATATA 

7501 GGCTTCTTCT TTTCTCTTTA TTAGAGGCAA GCCTGTTTTT T TACT CAGGA 

7551 ACGTGCTACA CGACCAGTGT ACTATGGATT GTGGACATTT CCTTCTGTGG 

7 601 AGACACGGTG GAGAACTAAA CAATTTTTTA AAGCCACTAT GGATTTAGTC 

7 651 ATCTGAATAT GCTGTGCAGA AAAAAT AT G G GCTCCAGTGG TTTTTACCAT 

77 01 GTCATTCTGA AATTTTTCTC TACTAGTTAT GTTTGATTTC TTTAAGTTTC 

77 51 AATAAAATCA TTTAGCATTG AATTCAGTGT ATACTCACAT TTCTTACAAT 

78 01 TTCTTATGAC TTGGAATGCA CAGGATCAAA AATGCAATGT GGTGGTGGCA 
7851 AGTTGTTGAA GTGCATTAGA CTCAACTGCT AGCCTATATT CAAGACCTGT 
7 901 CTCCTGTAAA GAACCCCTTC AGGTGCTTCA GACACCACTA ACCACAACCC 
7 951 TGGGAATGGT TCCAATACTC TCCTACTCCT CTGTCCACTG CTTAA 
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1 CATGCTTGCC TACTCCTCTG TCCACTGCTT TCGTGAAGAC AAGATGAAGT 

51 TCACAATTGT CTTTGCTGGA CTTCTTGGAG TCTTTCTAGC TCCTGCCCTA 

101 GCTAACTATA ATATCAACGT CAATGATGAC AACAACAATG CTGGAAGTGG 

151 GCAGCAGTCA GTGAGTGTCA ACAATGAACA CAATGTGGCC AATGTTGACA 

201 ATAACAACGG ATGGGACTCC TGGAATTCCA TCTGGGATTA TGGAAATGGC 

251 TTTGCTGCAA CCAGACTCTT TCAAAAGAAG ACATGCATTG TGCACAAAAT 

301 GAACAAGGAA GTCATGCCCT CCATTCAATC CCTTGATGCA CTGGTCAAGG 

351 AAAAGAAGCT TCAGGGTAAG GGACCAGGAG GACCACCTCC CAAGGGCCTG 

401 ATGTACTCAG TCAACCCAAA CAAAGTCGAT GACCTGAGCA AGTTCGGAAA 

451 AAACATTGCA AACATGTGTC GTGGGATTCC AACATACATG GCTGAGGAGA 

501 TGCAAGAGGC AAGCCTGTTT TTTTACTCAG GAACGTGCTA CACGACCAGT 

551 GT ACT AT GGA TTGTGGACAT TTCCTTCTGT GGAGACACGG TGGAGAACTA 

601 AACAATTTTT TAAAGCCACT ATGGATTTAG TCATCTGAAT ATGCTGTGCA 

651 GAAAAAATAT GGGCTCCAGT GGTTTTTACC ATGTCATTCT GAAATTTTTC 

7 01 TCTACTAGTT ATGTTTGATT TCTTTAAGTT TCAATAAAAT CATTTAGCAT 

751 TG 
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1 MKFTIVFAGLLGVFLAPALANYNIDVNDDNNNAGSGQQSVSVNNEHNVAN 50 

51 VDNNNGWDSWNSIWDYGNGFAATRLFQKKTCIVHKMKKEVMPSIQSLDAL 100 

101 VKEKKLQGKGPGGPPPKGLMYSVNPNKVDDLSKFGKNIANMCRGIPTYMA 150 

151 EEMQEASLFFYSGTCYTTSVLWIVDISFCGDTVEN 185 
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1 GAATTCAAAC AGCAGGCCAT CTTTCACCAG CACTATCCGA ATCTAGCCAT 

51 ACCAGCATTC TAGAAGAGAT GCAGGCAGTG AGCTAAGCAT CAGACCCCTG 

101 CAGCCCTGTA AGCTCCAGAC CAT GGAGAAG AGGAAGGTTG TGGGTTCAAG 

151 GAGCTTTTCA GAGTGGAAAT CTGTGGATCA GTGATTTATA AAACACAGTT 

201 TCCCCCTTTA TTAGATTTGA ACCACCAGCT TCAGTTGTAG AAGAGAACAG 

251 GTTAAAAAAT AATAAGTGTC AGTCAGTTCT CCTTCAAAAC TATTTTAAAC 

301 GTTTACTTAT TTTGCCAAGT GACAGTCTCT GCTTCCTCTC CTAGGAGAAG 

351 TCTTCCCTTA TTTTAATATA AT ATT TGAAA GTTTTCATTA TCTAGAGCAG 

401 TGGTTCTCAT CCTGTGGGCC ATGAGCCCTT TGGGGGGGTT GAACGACCCT 

451 TTCACAGGGG TCACATATCA GATATCCTGC ATCTTAGCTA TTTACATTAT 

501 GATTCATAAC AGTAGCAAAA TTAGTTAGGA AGTAGGAACA AAATAACGTT 

551 ATGGTTGTGG TCACCACTAT GTTAGAGGGT CCGCAGCATT CAGAGGGTTG 

601 AGAACTGTTG TTCTAGAGGC AAATAAGAAG ACAGAGTTCC TTGATAGGGC 

651 CCAGAGGCAG TGAAAGAAGT TTCCACGTAG AAAGTGAAGA AGGTCTGGTG 

7 01 TCCGAAGCAG TGAGGAACTT AAAAAAAGAA AACCAAAAAC ATTGCCAACT 
751 AACAGTCCAG GAGAAGAGCG GGGCATGAAA GGCTGAGTTC CCATGGGATG 

8 01 CCTTGAATGG AATCAGAGTG TGGGAAAATT GGTGTGGCTG GAAGGCAGGT 
851 GCCGGGCATC TCAGACGCTG GTAGCTGGGG AAAC AG G AAA CCCCTTTAGG 
901 ATCCCAAGAT GCCATTCCAA TGAGCTTGAG ATTTTTCTCA TGGACTGCCA 
951 GTGAATGTTT CTACGCTCCG GAAATTAATG TTTACTTATT TTCCATATTC 
1001 TAGGGGAGAA CCCTGGGAAA AAT GGAGGAC ATTCATTGAA ATATCTGAGT 
1051 CCTGGGATAA GGCAGGCTTG GTCCTACAAC TCTGGTAAAA GTCCATCAGG 
1101 AAGTGCCTTG ACCAAGGCTG GAGTGGAGAG CTGTTGGTGA GATGTAAGGG 
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1151 CAAGGTTTAG TTGCTAGATA TGTAGATGGC AAGATGGTGC TGCCAACAGC 

1201 CCCCAGAGCT CTAACCCACT GAGAAACCCA GGAATGAATG ATGGGAGATG 

1251 GCTTTGGTGC CAGCTGCTAG TGACATGGCT GGAAAGCTGC ACTGGCTTCG 

1301 AGGCCAGACA ATTCCTCAAG GAAACATCTG GCCAGGGTGC AAGGGCCAGT 

13 51 TTCCTTCCTT GGAGTTCCTT TCACAGCTAA GAACATCAT C CCCCAACCAC 

14 01 TGGTTTTGTT AAAAAGTTTT CAGTATGACT TGAGCATGGT CAAGAAGCAT 
14 51 AGAGAGGGGG AAATAAGGGT GGAAGGAGCT GGAGAAAGCT TACAATAGGA 
1501 CTGGGTAAAG GGAAGGAGAA GAAACCATTC CCGCATTCCC ATAGGAGCCA 
1551 GTACCAGGAA GGGCAGGTGT ACACACAGAT CTCATCTAAG GCCATGTTTG 
1601 GTTTAGGGAT TACTCTTCTC CCGAATCTGA GCAGCAGCAA TACGTAAAAT 
1651 ACCCACACCC ATGGCTTCCA TATT CCAGAA CTTATCACAA ACCGTGTAGA 
1701 GTTTACTGAG ATACCTTCGT CAGAGGATGA GTCAGAGGCC TCCTGCCTAA 
17 51 GGGCCCTACT GAGCAGGCAG CTAAAGGCTT CCGGGCCTCT GCAGCTCCAC 
1801 AGATACAGGA GAGGGAAGCA GATAAGCCGT GGACTCCACC TGAGCACACC 
1851 TAGCTTGAGC AAAGCTGGTC AGGTACAAAT AGCAGAGGGC TGAATGTCTG 
1901 TGAGCACGCC GCCTGATCCT CTGCTCCACC ACACTCCTGC CGCCATGAAG 
1951 CTCACAGTAA GTCAGATCTT CTTTTCAATG CAGCACCATA CAACATTAAT 
2001 AGTCAGGGGT GAGGGGGTCT GACTCTTACG GCACTGTTAC CATAGTGGAA 
2051 ATATTCTCCT TTCTTTTCAT GGAATCATGG TGTTTACAAG CATGTCCATA 
2101 GAGAAGAAGA ATTGCCCCGG AAGAGCCTGT CACAGGCTGA ATACTGTAGA 
2151 ATTGTCTTTC ACACCATCTG TTCCAAGGTT CTACTTAAGA CGAGCAGTCT 
2201 CTGGGCTCCA GAAAGAGTCT TTCTTAGCCT TGATCTCTTT CTTATTTCTG 
22 51 ATTTCTCCTT TCTTATCCAT GATTTCCACT TTTACCAGTT CTGGGCATGT 
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2301 TCCGGTCAGA CTGGAAGATC ACTGTTGTCA AAACTAGTCT TCAACACTCT 

2351 TGGCTGTTAA CATGAAAACA ACGGTCCTTG GGCCCTGTGC AAGCATTTCT 

24 01 TGGAGAAAGT CTCTGGGGAT GAAGCTATCT CAGTTTCCCC ACTGAAGTCC 

24 51 TAGGATACAG AGGCTCAAAC AGAGTGCACA TATTCAATTT CAGCAT ACT C 

2501 TATTGGCGCT GCTTTATGAA T CAT AT GAAT TTATGGAATT GGAAATGTAA 

2551 ACTATGACCA AGAAGCGTCC ACCTCAGAAC AGGTTGGGTG GGGAACTCCA 

2601 AGCACAGGCC AGAGGGCTGC GTTTCTCTTC TAGTTCTGTC TAGAGGAGTG 

2 651 GTTCTCGACC TTCCTAATGC TGTGACCCTT TAATACAGTT CCTCACGTTG 

27 01 TCGTGACTCC CAGCCATAAA ATTACTTTCA TTGCTACTGC ATAACTGTAA 

27 51 TTTTGCTACC ATTATGAGTT GTAATGTAAA TATCTGATAT GCAAGATACC 

28 01 AGATAACCTA AGAAACGGTT GTTTGACCTT TAAAGGGGTC ACAACCCACA 
2851 GGTGGAGAAC TACTGGTCTA GGGTCCTTTA CAGTCCTTTA GCTGCCTCAT 
2 901 TTACAGGAGA TAACATCATG CTCAAAAACT CCCTCCACAT TTGGCTTTTT 
2 951 GGGTTGTTTT GTTTTGTTTT TCAAGACAGG GTTTCTCTGT GTAGCCCTGG 
3001 CTGTCCTGGA ACTCACCTTT GTAGACCAGG CTGGCCTCGA ACT CAGAAAT 
3051 CCGCCTGCTT CTGCCTCCTG AGCGCTGGGA TTAAAGGCGT GCGCCACCAT 
3101 GTCTGGCTCA CATCTGGCTT TTTAAGAGAC CGATTTTAAC TTCTTGCATT 
3151 GAAAATAAAT ATAGTAGAAA TGCTTAACCT ACTAAGACAA TAAAAACAGG 
3201 ATTCCTTCTG CTAGGAAGAA CACGTTCCAG ACTAAGGAAA AAAACCTTTT 
3251 CAGGGCTTTC ATTACACTGT GCCATGCACT AATTTTATGT TTTCTTCATC 
3301 AGTTTTCAGT GTCTGAAATT CAGTGTCAAA ATTCTAAGAC TACATATGAA 
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3351 TATCATTACA GTAACTCAGC AATTCTATGT TACCAGTAAG TTTTTCTGTA 

34 01 GTTTAAAAAA AAGGTGGAAG AAGAAAGCAC AGATAGTTTA GCACATGGGT 

34 51 AAAATCAGTA ACTATTTCTG ATGAGCTTGG TGAAGATGCT GTAAACCATG 

3501 CGACCACCAG TCCTGTTCTC TGTGCTTTCA GATGTTCGTC GTGGGTCTGC 

3551 TTGGCCTCCT TGCAGCTCCT GGTTTTGCTT ACGTAAGTCT CATTTTTCTG 

3 601 AAGTTCATTG TCAAAACTGC ATTTACAGTG AAATGTGATC TTAAGTCACC 

3 651 CTCTGCTTCT TATGAACATT AGACGGT CAA CATCAATGGT AATGATGGCA 
37 01 ATGTAGACGG AAGTGGACAG CATTCGGTGA GCATCAATGG TGTGCACAAC 
37 51 GTGGCCAATA TCGACAACAA TAACGGCTGG GACTCCTGGA ATAGCCTCTG 
3801 GGACTATGAA AACGTATGTA ATGGACACAC AGGGTAAAGA TATGGTGTAG 
3851 CCACCACCCA TTAAAATTTC TGAGGTGAAT TCTAGCTGTT CAT GAAC AT T 
3901 AAAAGCTACC AGTAAAAGTG CCCATTCCAC TCAAAACAAT TTTACTTTTT 
3951 TGCATATAAT TATTGCTAAT AAGT AT T AC A CAATAGGTCG AAATTCAAAG 
4001 GGAT CAAT AG TAAGGATAAA AACTATGTAC AAAGACAAAC ACAGCATCCT 
4051 TTGGTCTTCC CTGCAGAGAG TCTCCATGAT GTTAAAGGTC CAATGTTTTA 
4101 TGGAGGCTGA ATGAAATACG AATGCCTCTG T GAT GGAAAA GGCCCAACAT 
4151 CTTATGGAGA ATGAGTGAAG TATGAATGCT ATTAGTTGTA AGAGAAGGCG 

4 201 ATGCAAAGCA ACACTTGGCA CCACCTGCCA ATTACTACTT TCCTATTTAA 
4251 ATGTAGTTTA AAAAGCAAAG CCTGTCTTCC CTGCCTCCTG GAAACACTGC 
4301 GGATGGAGGT AGACCAAGGT ATGACAGCCT TTAAAAGTTT GTCAGCAAAA 
4351 CACTCCCCCA TACACACATA CACACACCCT CCTACTACAC TGGAACTGAA 
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4 4 01 GCAAAGGCAG TGGGTTAGAT ATATCCACCC TCTAAGAGTT TGCAGGTCAT 

4451 CTATATATGA TAGCCAGAGA CACAACTGCA GGACAGCCAG ACTCTGAGCA 

4501 CTCTCCCCAG CTCCTTGTAG CTCTGTTTCA GTGGTGACTT GTGACAAGAA 

4 551 TCCTGGGGAA CCTGTGCCTC ACTGTTCTCT GTCTTCTTTA ATAGAGTTTC 

4 601 GCTGCCACGA GACTCTTCTC CAAGAAGTCA TGCATTGTGC ACAGAATGAA 

4 651 CAAGGATGCC ATGCCCTCCC TTCAGGACCT CGATACAATG GTCAAGGAAC 

47 01 AGAAGGTAAA GTCCTGCCTT CTTCTTTGGA GTGACAGGAA GTCTTACAGT 

47 51 CTCCAGTACA CAGTGAAGTC ACCCCCATTC CCTCTTTGGT GGAGCAT GAC 

4801 AGCATGTTTG T CAT GAT AAA TGCCACAAAC ATGTAAAACT GTTCAGTGTC 

4851 TGCCTGAATG GAGGGTGGCT TCCACTGTGT CAGATGCCGT GGCCCACATC 

4 901 TGCCTCTGCA GGGTCCAGTA AAGCACTGGC TATCTTGAGT GTCAGAGACC 

4 951 CAAAGGTCTG TACACTTCAG TACAAGCCCT CCATATTTCA AGGGCACACT 

5001 CCTACAGTCG TTGGGGTTAT CAGAACTAGC AAACATAGAG ACTGGATTTT 

5051 CAGATGAAAA GAAATCCTTT TTAAAGTCTA AGTATGCCTT ATACAATGTT 

5101 TGAGATATTC T CAAT ACT AA AAAAAAAAAA ATTGTTGCTT GCTTGAAAAT 

5151 CAAATGTAAC CAAGTGTCCT ATATCCAGTG TCAATCATGG CTGTAGTAGA 

5201 TGGGAAGAGG GAGCCCGTGG TTTTCACAGT CAGACGCCTG AGTTATTCTT 

5251 CTAAGTGATA AATTGGTTCC TATAACAAGC AAGCCAGTGA ATATAAATAA 

5301 GCTCTATCTC AGAAGTTATC CTGTAGTGCT ACCCTAGAAT CTAAGAGAGC 

5351 AAAAGTGCTT CAAATTTCAG AATAAGTTTT GCTTTGGACT TCTGTTTTTC 

54 01 TAAACAACTA TAACTTCAAA CCATCTAAGC CTCGTGGGAC ACT T AGAAAT 

5451 ACCAAGCCAT T CAAAGCT AG AATTGTTTCT TCACCTTACT TGAAAACAAA 
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5501 ATGACAACCA AAAATTGTCC CCACTGCCCT TGTACATCTT CAGATCAGTA 

5551 AAGTCCTGGG CTCAGGGATC ATTCACTTTC TTTCTTTCCT TTCACACTCA 

5601 ACTTCAGGGT AAAGGGCCTG GAGGAGCTCC TCCCAAGGAC TTGATGTACT 

5651 CCGTCAACCC TACCAGAGTG GAGGACCTGA ATACATTCGG ACCAAAGATT 

57 01 GCTGGCATGT GCAGGGGCAT CCCTACCTAT GTGGCCGAGG AGATTCCAGG 

57 51 TGTGTACCCT GAGATGCTGT ATATCCCAAT GCAGT ACT GA GAGAGCCATC 

58 01 AGACACTCTA AAGTGTGACC ACAGACGGAC CAAT CATGTG GATTATCAGA 
5851 GCAAACACTT GCTTGCTCCT TGTCAGACAG TTGTCCATGC TTCAAAAGTT 
5901 CATTAAAAAA AATAGTTCAC AGGCTCCTCA CAGAAACCTT AGTAGAATCC 
5951 ACAGCTTCTG CTCTTAGTCT TACTTTTTAG AAACTGAGAC CCAGAGAAAG 
6001 GT CACAAAAC TTTTGTCTGG CTCAGGTTCT ATGTCTTTAA CTTTATAGAA 
6051 TACCGTCTTT CTGGGTGGGT GGGCTCTAGA GTAAACTTCA AGTGAGTTCA 
6101 AGGAAAGCAT GAGAAGTAGG GAAGACCAAA TGAAAGGAGA ATGCCAATGA 
6151 AATCTATCGA TTCTATAGCG CCAATGCTTA ACTCCTAGGC GTTCAAAGAA 
62 01 TAGTATCCAC AAGGTGTCAG CCTAAGATCC TAATCTAACA GCAAGTTTTC 
6251 AGATCTCTGA AGT GAAAAGA GAAAGCAAGA GAGGAACAGA GACAGAAACA 
6301 GTAAGAGACA GAGAGGCAGA GACAAAGAGA CAGGGAGAAT AGAGAGGGAT 
6351 TAAAATTAAT ATATAGTTTA GAAATTACGA CTCCTCACAG TCCCTGCAGA 
6401 GTCCTAGGAT AGGCACTGAT TTGGACTTCT TTTCTTCTCA CTAGGACCAA 
6451 ACCAGCCTTT GTACTCAAAG AAGTGCTACA CAGCTGACAT ACTCTGGATT 
6501 CTGCGGATGT CCTTCTGT GG AACATCAGTG GAGACATACT AGAAGTCACA 
6551 GGAAAACAAC CCGTGGGCTC TGACCATCGC AATGCTTGAT TATGAGAGTG 
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6601 TTCTCTGGGG GTTGTGATTA 

6651 CAGCACATCC AGTTTGTAAT 

67 01 ATGTGGCACC TGCCAGCCTG 

67 51 CTCCAGAGTT TTCAAATACT 

68 01 AGGCCACTGC TGCCTGACTA 
6851 AAAAAAGAGC TCAGAGTGGC 
6901 GAGCCAGAAG TATCAATAAT 
6951 ACTAGAACAC ACGCCTGCAG 
7 001 CCCATAACTG CACACCCCCT 
7051 AAGATGGCCC TTGCTTTTTT 
7101 AGTCAACCTG ACTCTCTACA 
7151 ATTTACTGGT GAATGGCACT 
7201 GGATCCATTC ATTGACAGGC 
7 251 ATGGGTCTTT CCTTGAAGGA 



GCTTCTTTAA GGCTCAATAA ACCCACGTGG 
GACATGCCTC ATGACTTCTA TGGGAGTCCA 
TATTCAGGAC CTCTCCGCTA TAAAGCATCC 
ACAAAGCACA GCCTGGGTTT GGGCTCAGAT 
CATTACAGAC AAACAAGTTT TAAAAGAAAG 
TGGAATCAGC AAGGGTGTTT TTCCTGCAAG 
CACCCAAGGA GGAGACACTG GGAATGAGAG 
ATACGGAGAA CCTCAGCATT GCCGCTCTCT 
TCTGTAAACT CTGCTTCTTT CTTTCACCTG 
TTATTATAGG ACANGATAAC TAGACCAGAA 
TTTATATGTC TTCCCAGNTC AAGAAATATT 
TCTATATTCC CTTGGTTCAA TAAGTCTACA 
CAAGAGTGAG AT C AC AT GAT ACCCAAGCAC 
GAAGGATCCA 
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1 ATGTTCGTCGTGGGTCTGCTTGGCCTCCTTGCAGCTCCTGGTTTTGCTTACACGGTCAAC 

61 ATCAATGGTAATGATGGCAATGTAGACGGAAGTGGACAGCATTCGGTGAGCATCAATGGT 

121 GT GCAC AACGT GGC CAATAT CGACAACAAT AACGGCT GGGACTCCT GGAAT AGCCTCTGG 

181 GACTATGAAAACAGTTTCGCTGCCACGAGACTCTTCTCCAAGAAGTCATGCATTGTGCAC 

241 AGAATGAACAAGGATGCCATGCCCTCCCTTCAGGACCTCGATACAATGGTCAAGGAACAG 

301 AAGGGTAAAGGGCCTGGAGGAGCTCCTCCCAAGGACTTGATGTACTCCGTCAACCCTACC 

361 AGAGTGGAGGACCTGAATACATTCGGACCAAAGATTGCTGGCATGTGCAGGGGCATCCCT 

4 41 ACCTATGTGGCCGAGGAGATTCCAGGACCAAACCAGCCTTTGTACTCAAAGAAGTGCTAC 

501 ACAGCTGACATACTCTGGATTCTGCGGATGTCCTTTTGTGGAACATCAGTGGAGACATAC 

561 TAG 
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1 MKLTMFWGL LGLLAAPGFA YTVNINGNDG NVDGSGQQSV SINGVHNVAN 

51 IDNNNGWDSW NSLWDYENSF AATRLFSKKS CIVHRMNKDA MPSLQDLDTM 

101 VKEQKGKGPG GAPPKDLMYS VNPTRVEDLN TFGPKIAGMC RGIPTYVAEE 

151 IPGPNQPLYS KKCYTADILW ILRMSFCGTS VETY 
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1 atgcctgact tctcacttca ttgcattggt gaagccaaga tgaagttcac 

51 aattgccttt gctggacttc ttggtgtctt cctgactcct gcccttgctg 

101 actatagtat cagtgtcaac gacgacggca acagtggtgg aagtgggcag 

151 cagtcagtga gtgtcaacaa tgaacacaac gtggccaacg ttgacaataa 

201 caatggatgg aactcctgga atgccctctg ggactataga actggctttg 

251 ctgtaaccag actcttcgag aagaagtcat gcattgtgca caaaatgaag 

301 aaggaagcca tgccctccct tcaagccctt gatgcgctgg tcaaggaaaa 

351 gaagcttcag ggtaagggcc cagggggacc acctcccaag agcctgaggt 

401 actcagtcaa ccccaacaga gtcgacaacc tggacaagtt tggaaaatcc 

451 atcgttgcca tgtgcaaggg gattccaaca tacatggctg aagagattca 

501 aggagcaaac ctgatttcgt actcagaaaa gtgcatcagt gccaatatac 

551 tctggattct taacatttcc ttctgtggag gaatagcgga gaactaa 
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1 MKFTIAFAGL LGVFLTPALA DYSISVNDDG NSGGSGQQSV SVNWEHNVAN 

51 VDNNNGWNSW NALWDYRTGF AVTRLFEKKS C I VHKMKKE A MPSLQALDAL 

101 VKEKKLQGKG PGGPPPKSLR YSVNPNRVDN LDKFGKSIVA MCKGI PTYMA 

151 EEIQGANLIS YSEKCISANI LWILNISFCG GIAEN 
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1 MKFTIAFAGL LGVFLTPALA DYSISVNDDG NSGGSGQQSV SVNNEHNVAN 

51 VDNNNGWNSW NALWDYRTGF AVTRLFEKKS CIVHKMKKEA MPSLQALDAL 

101 VKEKKLQGKG PGGPPPKSLR YSVNPNRVDN LDKFGKSIVA MCKGIPTYMA 

151 EEIQGANLIS YSEKCISANI LWILNISFCG GIAEN 
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1 MKFT I VFAGLLGVFLAPALAN YN I DVN DDNNNAG S GQQ S V S VNNEHNVAN 50 

1 MKFTIAFAGLLGVFLTPALADYSISVNDDGNSGGSGQQSVSVNNEHNVAN 50 

51 VDNNNGWDSWNSIWDYGNGFAATRLFQKKTCIVHKMKKEVMPSIQSLDAL 100 

51 V DNNNGWN S WN AL W S YRT G FAVTRLFRKK SCI VHKMKKE AM P S L Q AL DAL 100 

101 VKEKKLQGKGPGGPPPKGLMYSVNPNKVDDLSKFGKNIANMCRGIPTYMA 150 

101 VKEKKLQGKGPGGPPPKSLRYSVNPNRVDNLDKFGKSIVAMCKGIPTYMA 150 

151 EEMQEASLFFYSGTCYTTSVLWIVDISFCGDTVEN 18 5 

151 EEIQGANLISYSEKCISANILWILNISFCGGIAEN 185 



FIG. 10 



Human MKFTIVF. AG LLGVFLAPAL 

Pig MKFTIAF. AG LLGVFLT PAL 

Mouse MKLTM.FWG LLGLLAAPGF 

51 

Human ANVDNNNGWD SWNSIWDYGN 

Pig ANVDNNNGWN SWNALWDYRT 

Mouse AN I DNNNGWD SWNSLWDYEN 

101 

Human ALVKEKKLQG KGPGGPPPKG 

Pig ALVKEKKLQG KGPGGPPPKS 

Mouse TMVKEQK . . G KGPGGAPPKD 

151 

Human MAEEMQEASL FFYSGTCYTT 

Pig MAEEIQGANL ISYSEKCISA 

Mouse VAEEIPGPNQ PLYSKKCYTA 



50 

ANYNIDVN.D DNNNAGSGQQ SVSVNNEHNV 
ADYSISVN.D DGNSGGSGQQ SVSVNNEHNV 
A. YTVNINGN DGNVDGSGQQ SVSINGVHNV 

100 

GFAATRLFQK KTCIVHKMNK EVMPSIQSLD 
GFAVTRLFEK KSCIVHKMKK EAMPSLQALD 
SFAATRLFSK KSCIVHRMNK DAMPSLQDLD 

150 

LMYSVNPNKV DDLSKFGKNI ANMCRGIPTY 
LRYSVNPNRV DNLDKFGKSI VAMCKGIPTY 
LMYSVNPTRV EDLNTFGPKI AGMCRGIPTY 

188 

SVLWIVDISF CGDTVEN 
NILWILNISF CGGIAEN 
DILWILRMSF CGTSVETY 
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